A Diagram is Worth a Dozen Images
نویسندگان
چکیده
Diagrams are common tools for representing complex concepts, relationships and events, often when it would be difficult to portray the same information with natural images. Understanding natural images has been extensively studied in computer vision, while diagram understanding has received little attention. In this paper, we study the problem of diagram interpretation and reasoning, the challenging task of identifying the structure of a diagram and the semantics of its constituents and their relationships. We introduce Diagram Parse Graphs (DPG) as our representation to model the structure of diagrams. We define syntactic parsing of diagrams as learning to infer DPGs for diagrams and study semantic interpretation and reasoning of diagrams in the context of diagram question answering. We devise an LSTM-based method for syntactic parsing of diagrams and introduce a DPG-based attention model for diagram question answering. We compile a new dataset of diagrams with exhaustive annotations of constituents and relationships for over 5,000 diagrams and 15,000 questions and answers. Our results show the significance of our models for syntactic parsing and question answering in diagrams using DPGs.
منابع مشابه
تشخیص پویای پلاک خودرو مبتنی بر مورفولوژی برای تصاویر رنگی و مادون قرمز
This paper proposes to use the method of edge detection, morphology, and dynamic image thickening for license plate extraction from images. In the proposed algorithm, a different thickening is used for rear and front parts of the image; besides, to increase the segmentation rate, determination of the license plate frame using standard deviation in the vertical histogram diagram is suggested. Fu...
متن کاملHygiene as Applied to Tropical and Sub-Tropical Climates
We have read the book with great interest, and while we cannot sa}7 that it contains anything not familiar to the student of hygiene in India, yet it certainly contains much not met with in ordinary manuals of hygiene, and as such is of special value. It is admirably illustrated. The chapter on conservancy is-particularly good, and to the reader at home full of new matter, the diagrams of forms...
متن کاملDeneb's variability: a hint of a deep-lying convection zone?
During their post – main-sequence evolution, massive star models harbor a convection zone on top of their hydrogen burning shell. This physical property helps, under suitable circumstances, to destabilize low-degree nonradial modes that might explain the cyclic light variability of α Cygni variables. The results of first exploratory computations demonstrate that modes with periods ranging from ...
متن کاملSegmentation of Magnetic Resonance Brain Images using Analog Constraint Satisfaction Neural Networks
The Grey-White Decision Network (GWDN) is presented as an analog constraint satisfaction neural network that segments magnetic resonance brain images. Constraints on signal intensity, neighborhood interactions and edge in uences are combined to assign labels of grey matter, white matter or \other" to each pixel. An improved version of this novel segmentation network that is provably stable is d...
متن کاملCost-effectiveness Analysis with Influence Diagrams.
BACKGROUND Cost-effectiveness analysis (CEA) is used increasingly in medicine to determine whether the health benefit of an intervention is worth the economic cost. Decision trees, the standard decision modeling technique for non-temporal domains, can only perform CEA for very small problems. OBJECTIVE To develop a method for CEA in problems involving several dozen variables. METHODS We exp...
متن کامل